CDS

Accession Number TCMCG019C19841
gbkey CDS
Protein Id XP_022949642.1
Location complement(join(2313776..2313847,2314329..2314397,2314482..2314515,2314708..2314785,2314862..2314979,2315272..2315355,2315482..2315532,2315675..2315746,2316136..2316214))
Gene LOC111452971
GeneID 111452971
Organism Cucurbita moschata

Protein

Length 218aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023093874.1
Definition DNA-directed RNA polymerases IV and V subunit 4-like isoform X1 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category K
Description DNA-directed RNA polymerases IV and V subunit
KEGG_TC -
KEGG_Module M00180        [VIEW IN KEGG]
KEGG_Reaction R00435        [VIEW IN KEGG]
R00441        [VIEW IN KEGG]
R00442        [VIEW IN KEGG]
R00443        [VIEW IN KEGG]
KEGG_rclass RC02795        [VIEW IN KEGG]
BRITE br01611        [VIEW IN KEGG]
ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K03012        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko00240        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko03020        [VIEW IN KEGG]
ko05016        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map03020        [VIEW IN KEGG]
map05016        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGGAGATAGGAGAAAAGGGTAATCCACTGCCAAGAAAACCTGGAAAGTCTTCGCTCAAGTCCTCTTTCAAGGATGCTTCTCTAAAAGGAAAGGATGATAGTCTGTTAAAGCCAAAGAAGGGAAGGAAAGTCCAGTTCGATGCTCAAGGATCTGTTGATGCGCAGATTAATTTTTCAATGAAATACAGTGGCAAAAATGGTGACTTGGGTAAAGGAGGAAAAGGCGGAAAAGGCGGAAGTGGTGCGAAGGAACCCCAGCCACTAGAATTGAAGATTGAACAAGAACTTCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGTGAGGCTGCACAAATTTTACAGGGAATCCAAGATCAGATGGTTCTTCTATCAGCAGATCCAACCATAAAAATCCCAACACCATTTGATAGGGGGTTGCAATATGCCAAAAGAGCCAATCACTACGTAAATACCGAGTCAGTTAGACCAGTTCTCGAAACCCTCAAGAAATATGGCGTAATGGACAGTGAGATGTGTGTGCTTGCTAATGTCTGCCCAGACACTACTGATGAAGTTTTTGCTCTTCTTCCATCCTTGAAGAGCAAAAGAAGCAAGCTGAGTGAACCTCTCAACAGTGTCTTGAGAGAGCTAGCCAAGGTAAAATCATCCTGA
Protein:  
MSEIGEKGNPLPRKPGKSSLKSSFKDASLKGKDDSLLKPKKGRKVQFDAQGSVDAQINFSMKYSGKNGDLGKGGKGGKGGSGAKEPQPLELKIEQELPKNVKCQCLMDCEAAQILQGIQDQMVLLSADPTIKIPTPFDRGLQYAKRANHYVNTESVRPVLETLKKYGVMDSEMCVLANVCPDTTDEVFALLPSLKSKRSKLSEPLNSVLRELAKVKSS